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SUPEROXIDE DISMUTASE CLONING AND EXPRESSION IN MICROORGANISMS 

This application is a continuation-in-part of Serial Number 538,607, filed October 3, 1983. 
DESCRIPTION OF SPECIFIC EMBODIMENTS 

5 



Field of the Invention 

to Superoxide dismutase ("SOD") is in fact a variety of different enzymes found in most living organisms. 
One function in mammals is to destroy superoxide, a material naturally produced during phagocytosis. The 
superoxide dismutases are characterized in families based on the metal associated with the enzyme, where 
the metals vary amongst iron, manganese, copper and copper-zinc. Superoxide dismutase, e.g., from 
bovine liver, has found clinical use. particularly as an anti-inflammatory agent in mammals including 

is humans. Other utilities include scavenging superoxide anions due to exposure of a host to various 
superoxide-inducing agents, e.g. radiation, 'paraquat, etc.; prophylaxis or therapy for certain, degenerative 
diseases, e.g.. emphysema; food preservation; and the like. 

It is therefore important that stable supplies of physiologically acceptable superoxide dismutase be 
made available, particularly for use in vivo as an anti-inflammatory agent or for other therapeutic purposes. 

20 For human application it would be preferable to employ the homologous enzyme to prevent or minimize 
possible immune response. By employing recombinant DNA techniques, there is the opportunity to produce 
products efficiently, which have the desired biological activities of superoxide dismutase, such as im- 
munological and enzymatic activities. 

25 

Description of the Prior Art 

The amino acid sequence of human erythrocyte Cu-Zn superoxide dismutase is described in Jabusch 
et al., Biochemistry (1980) 19:2310-2316 and Barra et al., FEBS Letters (1980) 1 20:53-55. Bovine eryth- 
30 rocyte Cu-Zn SOD is described by Steinman et al.. J. Biol. Chem : (1974) 249:7326-7338. A SOD-1 cDNA 
clone is described by Lieman-Hurwitz et al.. Proc. Natl. Acad . Set. USA (1982) 79:2808-2811. Concerning 
the effect on efficiency of translation of varying the untranslated region upstream from the initiation codon, 
see Gheysen et al.. Gene (1982) _t7:55-63; Thummel et al.. J. Virol . (1981) 37:683-697; and Matteucci and 
Heyneker. NucT Acids~Res . (1 983) 1Jj31 1 3-31 21 . 

SUMMARY OF THE INVENTION 



40 Efficient production of polypeptides demonstrating the biological activity of human Cu-Zn superoxide 
dismutase is demonstrated by the preparation of cDNA of the major portion of the structural gene, linking to 
a mixture of adapters providing for varying sequences extending from the ribosomal binding site to 
degenerate nucleotides in the coding region, and insertion of the complete gene with its translational signals 
into an expression vector. Transformation of microorganisms results in efficient production of a competent 

45 polypeptide demonstrating biological .activity of human Cu-Zn superoxide dismutase. The gene may be 
further used for combining with secretory and processing signals for secretion in an appropriate host. 

Novel protocols are provided for enhancing expression of a polypeptide involving the use of mixtures of 
adapters having varying sequences flanking the initiation site for translation, i.e., in the region between the 
ribosomal binding site and translational initiation site and in the initial several 5 -codons of the polypeptide, 

so where permitted by redundancy constraints of the genetic code. 

Polypeptides acetylated at their N-terminus and methods for producing such acetylated polypeptides 
are also provided. By providing a particular acetylation signal sequence at the 5 -end of the structural gene 
for a desired polypeptide, the N-termina! amino acid will be acetylated when the gene is expressed in 
yeast. The acetylation signal sequence encodes for at least the first two N-terminal amino acids, where the 
first amino acid is either alanine or glycine, and the second amino acid is a polar amino acid, usually being 
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threonine, serine or aspartate. Acetylation of human superoxide dismutase produced in yeast is dem- 
onstrated when the first two amino acids are alanine and threonine, respectively. — - " 

BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 indicates the DNA linker sequence and a flow diagram showing its use; 

Figs. 2 and 3 are flow diagrams indicating the preparation of plot5/SOD. 
70 Fig. 4 indicates the sequence of both the coding strand of human SOD cDNA (5 —3') and the 

resultant translation product. 

Fig. 5 illustrates the sequence of the isolated human SOD gene described in the Experimental 
section hereinafter. 

Fig. 6 is a restriction map of the isolated human SOD gene described in the Experimental section 
75 hereinafter. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 

20 . 

Methods and compositions are provided for the efficient expression of polypeptides demonstrating the 
biological activities of human Cu-Zn superoxide dismutase ("hSOD"). The methods employ a DNA 
sequence ("hSOD gene") encoding a substantial portion of the amino acid sequence of hSOD in 
conjunction with a translational initiation region optimized for expression in the expression host. The'^hSOD 

25 gene is inserted into an appropriate vector for expression in a host, conveniently under conditions which 
allow for secretion, so as to harvest the SOD product from the extracellular medium. 

Methods and compositions are also provided for the N-terminal acetylation of hSOD and other 
polypeptides. Hereinafter, acetylation refers to addition at the amino terminus of polypeptides and proteins 
in contrast to modification of amino acid side chains, e.g., lysine, as is also observed naturally. Acetylation 

30 of polypeptides and proteins is useful for a number of reasons. Where the natural condition of the 
polypeptide includes acetylation, as is the case for cytoplasmic hSOD, methods of expression which jnclude 
acetylation provide a product having the desired natural structure and conformation. Where the product 
finds pharmaceutical and/or in vitro or in vivo diagnostic use, the acetylated material will minimize or 
eliminate immunogenicity when administered to a host and/or exposed to biological samples.^ Also, 

35 acetylated polypeptides are likely to be more stable and resistant to degradation by proteases ana" thus 
enjoy a prolonged existance in the cell, blood or body and tissue fluids. 

The structural gene for hSOD or other polypeptide includes an acetylation signal sequence at the 5 - 
end thereof, which signal sequence causes a yeast expression host to effect actylation. The acetylation 
signal sequence encodes at least the first two N-terminal amino acids in the polypeptide. The first amino 

40 acid will be either alanine, glycine or serine, while the second amino acid will be a polar or aromatic amino 
acid, usually being threonine, serine, aspartate or phenylalanine. 

The amino acids may be the natural N-terminal amino acids normally present in the polypeptide to be 
expressed. This is the case with hSOD where the first two amino acids are alanine and threonine, 
respectively. Other naturally-acetylated proteins which may be expressed and acetylated in yeast include: 



45 





Protein 


Source 


Signal 
Sequence 




Cytochrome C 


Human. Rhesus Monkey, Dog, Horse, etc. 


GLY-ASP 


50 


Cytochrome C 


Castor, Sesame, Mung-bean, etc. 


ALA-SER 




Glutamate dehydrogenase 


Neurospora 


SER-ASN 




Calmodulin 


Pig 


SER-ALA 




Myosin (light chain A2) 




SER-PHE 




ADH 


Drosphila 


SER-PHE 


55 









The present invention is also useful for acetylating polypeptides and proteins which are not naturally 
acetylated. Acetylation may be achieved by joining the acetylation signal sequence to the 5 -end of the 
structural gene for the polypeptide. The acetylation signal sequence will encode for at least two amino acids 

3 
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(as described above), and may encode up to ten or more amino acids, preferably fewer than five amino 
acids. Fewer added amino acids is usually desirable to limit interference with or loss of a desired activity of 
the polypeptide. Conveniently, the signal sequence may be synthesized and joined to the structural gene 
using well known techniques. 

5 As an alternative to adding the acetylation signal sequence to the structural gene, it wilt sometimes be 

possible to modify the 5 -end of the structural gene to substitute one or both of the first two amino acids of 
the polypeptide. Such modification may be accomplished by a variety of conventional methods. For 
example, the structural gene may be restricted near its 5 -end to remove a known number of nucleotides. A 
synthetic oligonucleotide may then be joined to the cohesive end remaining after restriction. The 

to oligonucleotide will restore and substitute the base pairs as necessary to provide the desired acetylation 
signal sequence. Alternatively, site-specific mutagenesis employing, e.g., phage Ml 3. can be used to effect 
an appropriate modification to the 5 -end of the structural gene. 

In order to prepare hSOD, it is necessary to have a DNA sequence which encodes for hSOD. One 
manner of achieving such sequence, is to clone cDNA from messenger RNA from cells which produce 

75 hSOD. Conveniently, human liver cells may be used for this purpose. After the cDNA is cloned, where the 
DNA coding sequence is unknown, but at least a partial amino acid sequence is known, one may then 
screen the cDNA with mixtures of probes having all of the possible variations of nucleotides encoding for 
the particular series of amino acid residues. The choice of the residues for which the sequence encodes is 
somewhat arbitrary, although the residues chosen will usually be selected, to minimize the number of 

20 different sequences which must be synthesized. 

For hSOD. conveniently a DNA sequence encoding for at least the amino acid residues 19 to 24 can be 
used, particularly a probe having at least about 15 bases and not more than about 20 bases, more 
conveniently about 17 bases. One may then restriction enzyme digest the clones which appear to hybridize 
with the labeled probes, fractionate the DNA fragments and repeat the hybridization, particularly by 

25 employing a second series of probes which hybridize to DNA sequences encoding for a different series of 
amino acid residues in hSOD. Conveniently, these amino acid residues may be 109 to 114. One or more 
clones may be found which are positive to both probes and these may be used as a source for cDNA 
encoding for at least a substantial proportion of hSOD. 

Quite surprisingly, it was found that the amino acid sequences which have been published for hSOD 

30 differed in a significant number of residues from the amino acid sequence encoded for by the cDNA. 
Specifically, where the two published sequences differed (Jabusch et aL, Biochemistry (1980) 19:2310-2316 
and Barra et al.. FEBS Letters (1-980) 120:153-156), the correct assignments are: residue FT, aspartate; 
residue 1 7, isoleucine; residue 26, asparagine; residue 49. glutamate; residue 52, aspartate: residue 53, 
asparagine; residue 92, aspartate; residue 98, serine (see Fig. 4). 

35 Because of the uncertainties of the effect on translation of the separation between the ribosomal binding 
site and the translational initiation codon, normally AUG. the subject method provides a technique for 
varying the distance and nucleotides separating the ribosomal binding site from the initiation codon. 
Usually, there are from about 6 to 15, more usually about 6 to 12 nucleotides in the spacer between the 
ribosomal binding site and initiation codon. As the- base sequence downstream from the initiation site may 

40- also affect .'translation ' efficiency, the subject method also provides for variation of nucleotide sequence (but 
not length) within the initial several 5 -codons of the polypeptide as permitted by the redundancy constraints 
of the genetic code. Such degeneracy may intend up to 4 codons, more usually 2 codons, downstream 
from the initiation site. 

A plurality of linkers are prepared where at least 2 nucleotides, usually at least 3 nucleotides, and not 
45 more than 10 nucleotides, usually not more than about 6 nucleotides^ are varied to include members having 
each of the 4 nucleotides if within the spacer or 2, 3, or 4 nucleotides as permitted by genetic code 
redundancy if within the structural gene for the polypeptide. In addition, the linkers are prepared, having 
differing numbers of nucleotides, so as to provide a group of linkers differing not only in the sequence, but • 
also in length. The difference in length can be achieved by removal of portions of the support during the 
so (inker synthesis and, if appropriate, continuing synthesis at a subsequent stage, so as to provide for linkers 
having a graduated number of sequence lengths. Usually, the mixture of linkers will vary in length by at 
least one nucleotide and not more than over a range of six nucleotides, usually not more than four 
nucleotides. 

This can be conveniently illustrated where the absent bases are at the terminus. After each stage, a 
55 portion of the support is removed and the synthesis continued with the strands bound to the support, 
providing all four nucleotides (dNTP) at each stage. These single strands will then be hybridized to a single 
strand which is complementary in part, where the variable region will be an overhang. Thus, one will 
achieve a graduated series of linkers having overhangs differing in both their nucleotide sequences and 
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lengths. At an appropriate point during subsequent hybridization, ligation or cloning operations the overhang 
region's) is filled in to provide double-stranded material amenable to further manipulation. This is usually 
and preferably performed in vitro, e.g., using the Klenow fragment of DNA polymerase I: alternatively, in 
certain constructs the overhang could be cloned as a single strand with filling in occurring in vivo in the 
5 transformed or transfected host. Hybridization to a complementary strand can be achieved by having a 5 - 
sequence upstream from the variable nucleotide series which is complementary to a sequence present in 
the terminal sequence to which the linker is to be joined. The missing bases may then be filled in vitro or in 
vivo. 

The linkers include within their sequence, at least a portion of the region between the ribosomal binding 
70 site and the initiation codon, preferably the nucleotides proximal to the initiation codon. The linker may also 
include the initiation codon and portions of the structural gene, the ribosomal binding site, and bases 
upstream -from the ribosomal binding site, which may or may not include transcriptional regulatory 
sequences. 

Usually the linker will be at least about 5 bases, more usually at least about 20 bases, and usually not 

15 exceeding about 200 bases, more usually not exceeding about 100 bases. Where the linker is greater than 
about 35 bases, it will usually be assembled by employing single stranded sequences of from about 10 to 
35 bases, which have homology with only a part of a complementary strand, thus providing for complemen- 
tary overlapping sequences with overhangs, so that the various single strands can be hybridized, ligated ■ 
and the degenerate and/or variable length overhang filled in as indicated above to produce the desired; 

20 linker having cohesive and/or blunt ends. 

Where the structural gene has a convenient restriction site, usually not more than about 50 bases 
downstream from the initiation codon, a fragment containing the structural gene may be restricted and 
joined to a complementary cohesive terminus of the linker or may be filled in to provide a blunt-end 
terminus, which blunt end may be ligated to a blunt end of the linker. The linker is devised to ensure that 

25 the structural gene is complete and in reading frame with the initiation codon. 

As indicated, in preparing the linker, one provides that there are a series of linkers which, have a 
randomized series of nucleotides, that is, each of the four possible nucleotides in the coding strand (subject 
to the provision of genetic code limitations indicated above) and which are graduated in size, lacking:one or 
more of the nucleotides defining the region intermediate or bridging the ribosomal binding site and initiation 

30 codon. These linkers which are prepared from single strands may be joined to other single or double DNA 
strands to provide for extended linkers, which may include not only the ribosomal binding site, butvbases 
upstream from the ribosomal binding site. Alternatively, the linkers may be relatively small, beginning at a 
site internal to or adjacent to the ribosomal binding site and extending downstream to a site at the initiation 
codon or internal to the structural gene. s 

35 While the particular order of joining the various fragments to produce the constructs of this invention will 
usually not be critical, conveniently, the structural gene may be first joined to the linker. This DNA construct 
will include not only the structural gene, but also the ribosomal binding site and any additional nucleotides 
upstream from the ribosomal binding. site. In addition, there will be substantial variety in the nucleotides and 
numbers of nucleotides between the ribosomal binding site and initiation codon. The subject DNA construct 

40 is inserted into an appropriate expression vector which has the necessary transcriptional initiation regulatory 
sequences upstream, as well as transcriptional termination regulatory sequences downstream from the 
insertion site of the subject DNA construct. Thus, the linker will be flanked at the 5 -end with transcriptional 
initiation regulatory signal sequences and at the 3-end with at least a portion of a coding, region and 
transcriptional and translatronal termination sequences. (5 - and 3 -intend the direction of transcription.) 

45 After preparing the plasmid or viral DNA for introduction into an appropriate host {usually including at an 
appropriate stage in the manipulations filling in of the variable overhang region), the host is transformed or 
transfected, respectively, cloned, the clones streaked and individual clones selected for efficient expression 
by assaying for production of the desired product, e.g., hSOD. The number of clones to be screened to 
determine the various levels of production of the product will depend upon and be proportional to the 

so degreee of length variability and sequence degeneracy introduced into the synthetic linker. As exemplified 
in the present embodiment, with 4 length variables and 4- fold sequence degeneracy at each of 6 
nucleotides in the linker, the number of possible recombinant sequences is 5440. Usually at least a few 
hundred, preferably several thousand or more, clones will be screened. Screening can be efficiently 
performed using Western blots (antibody detection of product) of host cell colonies or viral plaques 

55 transferred to filters of nitrocellulase or other suitable material. Alternatively, using electrophoresis and 
providing for a plurality of lanes, where each lane is an individual clone, an immediate and direct 
comparison can be made of which clones are most efficient in expression by visualization of staining 
intensity, autoradiography or Western blotting of the product band. This screen will usually be sufficient, 
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although more quantitative immunoassays or enzyme assays can be employed, as appropriate. 

If desired, the construct can be transferred to a different host which recognizes the regulatory "signals of 
the expression construct or the expression construct modified by introduction at appropriate sites of 
necessary regulatory signals to provide for efficient expression in an alternative host. 

5 If desired, the hSOD gene may be joined to secretory leader and processing signals to provide for 

secretion and processing of the hSOD. Various secretory leader and processing signals have been 
described in the literature. See for example, U.S. Patent Nos. 4,336.336 and 4,338.397, as well as 
copending application Serial Nos. 522.909. filed August 12, 1983 and 488.857, filed April 26, 1983, the 
relevant portions of which are incorporated herein by reference. 

w Of particular interest as hosts are unicellular microorganism hosts, both prokaryotes and eukaryotes, 
such as bacteria, algae, fungi, etc. In particular, E. colt, B. subtilis , S. cerevisiae . Streptomyces . Neurospora 
may afford hosts. 

A wide variety of vectors are available for use in unicellular microorganisms, the vectors being derived 
from plasmids and viruses. The vectors may be single copy or low or high multicopy vectors. Vectors may 

/s serve for cloning and/or expression. In view of the ample literature concerning vectors, commercial 
availability of many vectors, and even manuals describing vectors and their restriction maps and char- 
acteristics, no extensive discussion is required here. As is well-known, the vectors normally involve markers 
allowing for selection, which markers may provide for cytotoxic agent resistance, prototrophy or immunity. 
Frequently, a plurality of markers are present, which provide for different characteristics. 

20 In addition to the markers, vectors will have a replication system and in the case of expression vectors, 
will usually include both the initiation and termination transcriptional regulatory signals, such as promoters, 
which may be single or multiple tandem promoters, an mRNA capping sequence, a TATA box, enhancers, 
terminator, polyadenylation sequence, and one or more stop codons associated with the terminator. For 
translation, there will frequently be a ribosomal binding site as well as one or more stop codons, although 

25 usually stop codons will be associated with a structural gene. Alternatively, these regulatory sequences may 
be present on a fragment containing the structural gene, which is inserted into the vector. 

Usually, there will be one or more restriction sites conveniently located for insertion of the structural 
gene into the expression vector. Once inserted, the expression vector containing the structural gene may be 
introduced into an appropriate host and the host cloned providing for efficient expression of hSOD. 

30 In some instances, specialized properties may be provided for the vector, such as temperature 
sensitivity of expression, operators or activators for regulation of transcription, and the like. Of particular 
interest is, the ability to control transcription - by exogenous means, such as temperature, ■ inducers, 
corepressors, etc., where transcription can be induced or repressed by an exogenous compound, usually 
organic. 

as Where the hSOD is made intracellular^, when the cell culture has reached a high density, the cells may 
be isolated, conveniently by centrifugation, lysed and the hSOD isolated by various techniques, such as 
extraction, affinity chromatography, electrophoresis, dialysis, or combinations thereof. Where the product is 
secreted, similar techniques may be employed with the nutrient medium, but the desired product will be a 
substantially higher proportion of total protein in the nutrient medium, than in the cell lysate. 

40 The hSOD which is formed has substantially the same amino acid sequence as the naturally occurring 
human superoxide dismutase, usually differing by fewer than 5 amino acids, more usually differing by fewer 
than 2 amino acids. The recombinant hSOD {"r-hSOO") displays substantially the same biological prop- 
erties as naturally occurring hSOD. The biological properties include imnunological properties, where 
antibodies raised to authentic hSOD cross-react with r-hSOD. Furthermore, in common bioassays employed 

45 for hSOD, the r-hSOD product demonstrates a substantial proportion, usually at least about 10%. preferably 
at least about 50%, more preferably at least about 80%, of the enzymatic activity of the authentic hSOO, 
based on weight of protein. An illustrative assay technique is described by Marklund and Marklund, Eur. J. 
Biochem . (1974) 47:469-474. 

The following examples are offered by way of illustration and not by way of limitation. 

50 

EXPERIMENTAL 



55 

Molecular Cloning of hSOO cDNA 

Total RNA was prepared from an adult human liver by the guanidinium thiocyanate/lithium chloride 
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method (Cathala et al., DNA (1983) 2:329-335). polyA RNA was used to synthesize double-stranded cDNA 
(Maniatis et al.. Molecular Cloning , 213-242, Cold Spring Harbor, 1982) and this was passed over a 
Sepharose CL4B column to enrich for cDNAs of greater than 350bp (Fiddes and Goodman, Nature (1979) 
281 :351-356). The cDNA was inserted at the Pstl site of plot4, a pBR322 derivative having the following 
s sequence replacing the Pstl- Eco RI site. 

Pst l Hln fl Alul 
1 GGTGAATCCGTAATCATGGTCATAGCTGTTTCCTCTGTGAAATTCTTATCCCCTC 
ACGTCCACTTACCCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCCAG 
10 Hph I 

HlndllAluI 

60 ACAATTCCACACATTATACGAGCCGATCATTAATTGTCAACAGCTCATTTCAGAATATTT 
TGTTAAGGTGTGTAATATGCTCGGCTACTAATTAACAGTTGTCGAGTAAAGTCTTATAAA 

;s 

EcoRI 

120 GCCAGAACCGTTATGATGCGG 

CGGTCTTGGCAATACTACGCCTTAA 

20 

The cDNA insertion employed the oiigo-dG:dC tailing method (Maniatis et al., supra). E. coli strain 01210 
was transformed with this mixture and transformants selected on L-agaT containing KTug/ml tetracycline 
(Kushner. S.R. (1978) In: Genetic Engineering, eds. Boyer, H.B. and Nicosia, S., (Elsevier/North Holland, 
Amsterdam) p. 17). Plasmid DNA constituting a liver cDNA library was prepared (Maniatis et al.. Molecular 
25 Cloning , pp. 86-94. Cold Spring Harbor 1982) directly from- approximately 62,000 recombinant -colonies 
plated at a density of approximately 3,000 colonies per 9cm diameter Petri dish. 



Isolation of r-hSOD Clones 

30 • 

Strain D1 210 was retransformed with the liver cDNA library and about 40,000 clones were grown on 
nine 14cm diameter Petri dishes. After transfer of the colonies to nitrocellulose paper and chloramphenicol 
amplification of plasmid DNA, the cells were lysed and the filters prepared for hybridization (Ish-Horowicz 
and Burke, Nucleic Acids Research (1981) 9:2989-2998). Oligonucleotide probes were employed for 
35 screening by hybridization, with the probes consisting of enzymatically-radiolabeled, chemically-synthesized 
DNA molecules complementary to the mRNA encoding amino acid residues 19 to 24 of the protein 
(Jabusch et al.. supra .; Barra et al.. supra.); the mixture had the following sequences: 

3 1 TTA AAA CTT GTT TTT CT 5' 
40 G G C C C 

where ail of the indicated possibilities for encoding the peptide sequence were prepared (32-fold degen- 
erate). 

The probes were labeled with 32 P to a specific activity of 1 -3xl0 8 cpm/u,g and Millipore 0.45u.m) filtered 
45 before use. Filters were prehybridized for6hrs at 30 *C in 4x SSC, 2x Denhardts's solution, 40mM sodium 
phosphate, pH 7.5, 300ug/ml sonicated salmon testes DNA. Hybridization was for 20hrs at 30 * C in the 
same solution containing 2x10 6 cpm/ml hSOD DNA probe (residues 19-24). Filters were washed in 4x SSC, 
once for I5min at r.t. and twice for l5min at 30 C, blotted dry and autoradiographed with an. intensifying 
screen for 24hrs at -70* C. 

50 Areas on the master plates that corresponded to duplicate positive signals were picked into L-broth and 
plasmid DNA prepared by the miniscreen procedure (Maniatis et al.. Molecular Cloning. 178, 368-369. Cold 
Spring Harbor 1982).. This DNA was cut with Pstl and subjected~to*Southern blot analysis (Southern, J. Mol. 
Biol . (1975) 98:503-517) hybridizing initially with the previous labeled probes (amino acid residues T9-24) 
and then with additional radiolabeled probes derived from amino acid residues 109-114 and having the 
following sequences (all possible variations, 72-fold degenerate) present as a mixture: 
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15 



20 



3 1 CTA GTA ACA TAA TAA CC 5' 
G G G G G 
T T 

One plasmid pool (pSOD1) contained a cDNA insert of 520bp that hybridized with both probes and after 
colony purification, plasmid DNA was prepared from this clone and sequenced by the method of Maxam 
and Gilbert ( Proc. Natl. Acad. Sci. USA (1977) 74:560-564) with the results shown in Fig. 4. The hSOD 
cDNA clone pSOD1 constitutes the coding region for amino acids 10-153 of hSOD, a single translational 
stop codon and a 3' untranslated region. Therefore, in the expression vector construct, the base sequence 
of the region encoding amino acids 1-9 is derived from the published amino acid sequence of hSOO 
(Jabusch et al.. supra; Barra et al., supra) and synthesized chemically as a part of the variable (inker 
segment (s"ee~"below). Construct~on" "of Plasmid plots - (See Figures 2 and 3) 

Plasmid plotl, containing a hybrid trp-tac ("tac") promoter (DeBoer et al.. Proc. Natl. Acad. Sci. USA - 
(1983) 80:21-2*5) was constructed by gel isolating~the I80bp HgiA-Taql fragment of ptrpLI (Edman et al.. 
Nature (T981) 291:503-506) and the 58bp Hpall-EcoRI fragment from pKB268 (Backman and Ptashne. Cell - 
(1978) 13:65-71), and ligating these fragments to pBR322 digested with Pstl and Eco RI. The resulting 
plasmicfwas used to transform strain D1210 and clones selected for tetracycline resistance. Plasmid plot3 
was constructed by gel isolating the 155bp Fnu 4Ht- Eco RI fragment of plotl containing the tac promoter, 
with the Fnu4HI site being made flush-ended using the Klenow fragment of DNA polymerase I ("pol I K" or 
"pol. KlerO, and the 18bp EcoRI-Pstl polylinker fragment of nAN7 of the following sequence: ■ 

EcoRI Hindi I I 



25 5 • AATTCCCGGGGATCCGTCGACCTGCAGATCTCTAGA 

3 ■ GGGCCCCTAGGCAGCTGGACGTCTAGAGATCTTCGA 

I I 
Sai l Pst l 

30 

These fragments were ligated to gel purified pBR322 digested with Eco RI, flush-ended using pol I K, 
followed by digestion with Pstl and gel purified. This ligation mix.wasused to transform strain D1210, 
selecting on L-agar plates containing 10ag/ml tetracycline. 

Plasmid plots was made by first constructing a plasmid containing the nAN7 polylinker as an EcoRl- 
35 Pvull substitution in pBR322. To do this, plasmid nAN7 was digested with Hindlll, made flush-ended by 
filTTng in with pol I K and a synthetic, self-complementary. Pvu ll linker molecule (d(5'-CCAGCTGG-3 )) 
ligated to the above-modified plasmid nAN7. After digestion with Eco RI and Pvu ll, the resultant 44bp 
polylinker fragment (with 4-base overhangs) was gel isolated and cloned into pBR322 as an EcoRI-Pvull 
substitution. 

Plasmid plot3 was digested with EcoRI and after phenol-chloroform extraction and ethanol precipitation, 
the protruding" -ends were made flush-ended by treatment with S1 nuclease (Palmiter. Biochemistry - 
(1974) 13:3606-3615; Hallewell and Emtage, Gene (1980) 9:27-47). After phenol-chloroform extraction and 
ethanol~precipitation, the DNA was digested with Clal, made~flush-ended by pol I K, and the 237bp fragment 
containing the tac promoter isolated by preparative polyacrylamide gel electrophoresis. This flush-ended tac 
promoter fragment was then inserted at the Pvu ll site of the pBR322 polylinker plasmid (see Fig. 3) and 
clones obtained in which the tac promoter directed transcription towards the ^-lactamase gene of pBR322. 



40 



45 



Construction of plots Derivatives Expressing r-hSOD 

The synthetic DNA molecules F(26), C(16), B(31). D(11), E(13) and 4(24) shown in Fig. t, were 
synthesized by the phosphoramidite method. The single strand 4(24) was prepared by using all four bases, 
at each site where X is indicated. Furthermore, silica was withdrawn from the synthesis of the 24mer, such 
that single-stranded 2lmers. 22mers, and 23mers are obtained in addition to the 24mers. After removal 
from the silica support, the four mixtures are combined in appropriate proportions to provide for equimolar 
amounts of each of the possible single strands. This mixture was treated as a single product in the 
subsequent steps. 

Molecules F(26). C(16), 8(31) and D(11) were mixed together in equimolar amounts and lOug 
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phosphorylated using T4 polynucleotide kinase. After phenol-ether extraction, the additional non- 
phosphorylated synthetic DNA molecules 4(24) and E(13) were added, such that all fragments were 
equimolar. The equimolar mixture contained 13u,g of DNA in 133ul of 0.3x kinase buffer. 

After annealing by cooling at a uniform rate from 70 *C to 20* C over 60min. the single strands were 

5 ligated together with T4 ligase in 200ul ligation mix at 14* C for 4hrs, phenol-chloroform extracted, ethanol 
precipitated and the 5 -ends of 4(24) and E(1 3) phosphorylated using T4 polynucleotide kinase (Maniatis et 
al., supra). Preparative polyacrylamide gel electrophoresis was used to isolate the completely ligated 53bp 
material having 5 • and 3 -overhangs. 

The above purified fragment mixture was then tigated to the 460bp Taq l-Fstl segment of the hSOD 

w cDNA as shown in Fig. 1. This segment was itself constructed by isolating the 454bp Taql-Alul hSOD 
fragment, making it flush-ended pol I K using and inserting it into plots between its Eco RI and Sail sites 
(see Fig. 3) which had been similarly made flush-ended. After preparation of plasmid DNA from this 
recombinant, the 460bp Taq l-Pstl hSOD fragment was isolated by preparative polyacrylamide gel elec- 
trophoresis. After extraction and precipitation, the 515bp fragment resulting from the joining of the synthetic 

ts fragment to the 460bp Taql-Pstl hSOD fragment was filled in with pol I K (525-528bp) and then digested 
with Sail and the resulting 519-522bp hSOD fragment isolated by polyacrylamide gel electrophoresis. This 
fragrnent was then inserted into plots which had been digested with Pvull and Sai l and. then treated with 
alkaline phosphatase. The resulting plasmids were used to transform strain D1210. Recombinants obtained 
after transformation of strain D1210 were selected on L-agar containing lOOng/ml ampicillin to give a set of 

20 clones (designated plotS/SOD) with variable SOD expression. 

r-hSOD Expression and plotS/SOD Plasmid Selection 

25 For analysis of total E. coli proteins by SDS-pofyacrylamide gel electrophoresis, overnight cultures were 
diluted 30-fold into 1ml 6T £broth and grown shaking at 37* C for 90min. to an O.D.sso of about 0.2, IPTG 
(isopropylthiogalactoside) was added to a final concentration of 2mM-and the cultures incubated an 
additional 3hrs. After centrifugation, the cell pellet was resuspended in 50u.l of gel loading buffer (Laemmli, 
Nature (1970) 227:680-685) and lysed by repeating the following procedure 3x: Freezing for 1min.. boiling 

30 for 2min., vortexing for 10sec: 

After electrophoresis resolution (Laemmli, supra) the protein bands were stained' with Coomasie blue 
and the amount of SOD produced" by each clone estimated; these results were then confirmed using 
Western blots with antibody to authentic human SOD. Over three hundred clones were analyzed and 
exhibited levels of SOD expression varying from little or none to amounts estimated to be 5-10% A of the total 

35 soluble cellular protein. Results for six of the over three hundred clones are presented in Table 17 along with 
the particular sequence for DNA molecule 4(24) as determined by the method of Maxam and Gilbert, supra . 

Table 1 



Sequence and Levels of SOD Production in E. coli 




Sequence: 




Clone 


5'-XXXX 


- ATG 


GCX 


ACX 


Approximate Weight 












Percent of Total Protein 


pSODX8 


AACA 




A 


G 


5% 


pSOD11 


GTAT 




T 


G 


10% 


pSODx16 


ACAT 




A 


■ A 


5% 


pSOD5 


AATC 




A 


A 


1 0% 


pSOD16 


ATTT 




T 


T 


10% 


PSOD18 


AACT 




A 


G 


7.5% 



Assays for r-hSOD 

For enzyme assays, 100ml cultures were induced as described above. Soluble cell extracts were 
prepared by vortexing cell pellets !0x20sec. with an equal volume of 0.45-0.5mm diameter glass beads 
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(Braun AG) and an equal volume of assay buffer (50mM * Triscacodylate. pH 8.2, 1mM 
diethylenetriaminepentaacetic acid) in a 1.5ml microfuge tube at 4*C. Ceil debris was pelleted by spinning 
the tube for 1min. in a microfuge (Beckman). The supernatant was dialysed against the assay buffer 
containing 1mM each of zinc sulfate and copper sulfate for 4hrs at 4* C. 

5: SOD assays were performed using the pyrogaltol method (Marklund and Marklund, supra ). The reaction 
mixtures employed 0.2m M pyrogallol in assay buffer and reaction rates were determined over a 5min. 
period using a Hewlett-Packard 8450 spectrophotometer at 420nm. Four different assay samples were 
prepared: soluble E. coH extracts; authentic hSOO; and each of the prior samples pre-incubated with rabbit 
antibody to authentic hSOD. Each sample was incubated in a cuvette for 1min. at 25 *C before adding the 

io pyrogallol and assaying at 25 *C. The antibody samples involved a preincubation of 10min. at room 
temperature in assay buffer with 5ul of antibody. These conditions were found to be sufficient to inactivate 
lOOng of pure hSOD. 

The following Table 2 indicates the results for one of the clones examined (pSODx8): 
is Table 2 





Enzymatic Activity of Human Cu-Zn SOD Produced in E. Coli (strain D1210 (pSODX8)) 




Enzyme Preparation 


Units 


20 




SOD/mg 






Protein 


25 


pure Human Cu-Zn SOD 
pSODX8 protein extract 

pSODX8 protein extract preincubated with rabbit anti-human SOD antibody 
plots protein extract 

plot5 protein extract preincubated with rabbit anti-human SOD antibody 


15.384 
3.017 
685 
470 
485 



These data indicate that approximately 15% of the total soluble cellular protein was hSOD (assuming 
30 that the pure human Cu-Zn SOD used as a reference was fully active). Taken together with the 
electrophoretic data (see above) indicating that 5-10% of total soluble cellular protein migrated as hSOD, it 
appears that a substantial fraction, probably a majority of the hSOD produced is active. 

The correct sequence of the cloned gene was determined by the method of Maxam and Gilbert, supra. 
In addition, the first twelve amino acids at the N-terminus were determined by automated Edman 
35 degradation. The detected sequence of amino acids was as follows: 
ALA-THR-LYS-ALA-VAL-(CYS)-VAL-LEU-LYS-GLY-ASP-GLY- 

The first ALA residue detected was present at a molar concentration approximately equal to that of the input 
peptide indicating the absence of a blocked amino terminus. The CYS residue was not detected by the 
method of amino acid analysis used, but its presence was inferred from the nucleotide sequence. 

40 Thus, the (N-formyl-) methionine was removed from the* bacterial expression product and; the material* 
had the correct amino acid sequences, i.e. identical to that reported for cytoplasmic hSOD residues 1-10, 
but the N-terminal ALA residue was not acetylated. Furthermore, the polypeptide made in E. coli migrated 
more slowly than the natural protein in 1% agarose gel (pH 8.6) electrophoresis which detects" differences in 
charge (Corning Universal electrophoresis film, stained according to Clausen, Immunochemical Technique, 

45 p. 530-531). also indicating lack of acetylation. In addition, analysis of tryptic peptides of both the bacterial 
hSOD polypeptide and the, purified, authentic (acetylated) natural protein revealed that all tryptic peptides 
were identical, except the bacterial N-terminal peptide which migrated differently, i.e., with a charge 
expected for a peptide lacking the N-acetyl group. 

50 

Expression in Yeast 

For transfer of the r-hSOD gene to a yeast vector, the plot5/SOD plasm id clones were screened for an 
Ncot restriction site at the 5 -end of the coding region. For Those plasmids where the variable nucleotides 
55 present 5 to the ATG initiation codon are CC, the sequence CCATGG provides an Ncol site. Clones were 
screened, and one was selected and designated phSOD. 

The plasmid phSOD was digested with Nco l and Sail and a 550bp fragment obtained, which included 1 
nucleotide untranslated at the s'-terminus and theTntire coding region for hSOD. pPGAP (a yeast 
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expression vector carrying the GAP promoter, prepared as described below) was digested with Ncol and 
Sail followed by treatment with alkaline phosphatase, and the Satl- Nco l fragment substituted for the Nco l- 
Sall fragment in pPGAP to provide PPGAPSOD. Bam HI digestion of pPGAPSOD resulted in a 2kb fragment 
which was gel isolated and inserted into the BamHI site of pCl/1 and pC1/1 GAL4/370, to yield plasmids 

5 pCUlGAPSOD and pCl/1 GALGAPSOD. respectively. 

Plasmid pCl/1 is a derivative of pJDB219 (Beggs, Nature - (1978) 275:104) in which the region 
corresponding to bacterial plasmid pMB9 in pJDB219 was replaced by pBR322 in pCl/1. For preparing an 
expression vector having secretory and processing signals, see U.S. application serial no. 522.909. Plasmid 
PC1/1GAL4/370, a regulatable yeast expression vector containing the GAL1/GAL10 regulatory region 

io (controlled by the GAL4 gene expression product) is prepared as described below. 

Plasmids pC1/lGAPSOD and pC1/1 GALGAPSOD were transformed into yeast strain 2150-2-3 (available 
from Lee Hartwell, University of Washington) as described previously (Hinnen et at. Proc. Natl. Acad. Sci. 
USA (1978) 75:1929-1933), with the results of expression set forth in the following Table 3. 

15 Table 3 



Expression of Human SOD in Yeast Strain 
2150 


Plasmid 


Carbon 
Source 


SOD 2 ug/mg 
. protein 


pd/1 


g, L« 


0 


PC1/1GAPSOD 




148 


pC1/1 GALGAPSOD 


g. l 


0.4 


gal 


68 



1 AH cultures grown in Minus Leucine media with 2% lactic acid, 3% 
glycerol with or without 2% galactose to late log or early stationary 
phase. 

2 Determined by RIA. 



hSOD levels were measured using a standard radioimmunoassay with, iodinated authentic hSOD as 
35 standard. Constitutive synthesis from the GAP promoter leads to very high levels of hSOD production, of 

the order of 10-30% of the total cell protein. The induction with galactose works almost as well, yielding 

about 7% of the cell protein as hSOD. 

When hSOO is produced at these high levels, it is usually necessary to provide zinc and copper ion to 
. the product protein as a- prosthetic group in order to recover full enzymatic, i.e., catalytic, activity, e.g., by ' 
40 dialysis against 1mM solutions of both zinc and copper sulfate.Alternativefy, zinc and/or copper ion may be 

included in ten growth media; this method also provides a means of selecting for strains producing high 

levels of hSOD and/or avoiding the loss of plasmid vectors expressing hSOD in otherwise non-selective 

media. 

45 

Construction of pPGAP 

pGAP1, a plasmid prepared by insertion of a Hindlll fragment containing the GAPDH gene GAP49 
(Holland and Holland, J. Biot. Chem. (1979) 254 :5466 : 5474) inserted in the Hindlll site of pBR322, was 
so digested with Hinfl and~a 500bp fragment isolated. The fragment was resected with Bal31 to remove about 
50 or 90bp, followed by ligation with HindIM linkers and digestion with Hindlll. pBR322 was digested with 
Hindlll, followed by treatment with alkaline phosphatase and. the about 450 or 410bp fragment inserted to 
provide pGAP 128. 

PGAP128 was digested with ' Hindlll, the fragment made blunt-ended with the Klenow fragment and 
55 dNTPs and the resulting 450bp fragment isolated by gel electrophoresis. This fragment was inserted into 
Smal digested plots, which had been treated with alkaline phosphatase, to provide plasmid plot5pGAP128, 
which contained about -400 to + 27bp of the GAPDH promoter and coding region. 

Yeast expression vector pPGAP having a polyrestriction site linker between the GAPDH terminator and 
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short promoter region was prepared as follows. Plasmid plot5pGAP128 was digested with BamHI and Taql 
to yield an approximately 390bp BamHI-Taql fragment having the -400 to -26bp of the GAPDH promoter. 
The Bam HI' Taq l fragment was ligated to a synthetic fragment containing -25 to -1 bp of the GAPDH 
promoter and several restriction sites including Ncol and having the following sequence: 

CGA.TA, (CA) ^TA-CA-CACCATG-A-T-CGT-AG- 

T^AT^ (GT) ^AT^GT^GTGGTAC^T^A^GCA^TC^AGCT 



to provide a BamHI-Sall fragment, which was digested with BamHI and Sail and used to replace the BamHI- 
Sall fragment of Bam HI-Sall digested pBR322 treated with alkaline phosphatase. After ligation, the plasmid 
pGAPNRS was obtained which was digested with BamHI and Sail to provide a 400bp BamHI-Sall fragment 
which was gel isolated. This fragment was ligated to an aboutlJOObp Sall-BamHI fragment containing the 
GAPDH terminator region and a short segment of 3 coding region andThe resulting 1.4kb BamHI-BamHI 
fragment digested with Bam HI. The Salt-Bam HI GAPDH terminator fragment was obtained by Sail and 
Bam HI digestion of pGAP2, a plasmid prepared by insertion of an about 3.3kb BamHI fragment containing 
the GAPDH gene GAP49 (Holland and Holland, supra ) into the Bam HI site of pBR322. Plasmids pGAP2 and 
PGAP1 were obtained as follows: A yeast gene library was prepared by inserting fragments obtained after 
partial digestion of total yeast DNA with restriction endonuclease . Sau3A in lambda-phage . Charon 28 
(Blattner et al., Science (1977) 196 :161-169). The phage library was screened with DNA complementary to 
the yeast GAPDH mRNA and the yeast GAPDH gene from one of these clones was subcloned as either an 
about 3.3kb Bam HI fragment in the Bam HI site of pBR322 (pGAP-2) or as an about 2.1 kb Hindlll fragment 
in the Hindlll site of pBR322 (pGAP-1). 

pBR322 was digested with Eco RI and San, the termini blunt-ended and ligated to Bam HI linkers, 
followed by Bam HI digestion and the Bam HI- Bam HI 3.8kb fragment gel isolated, recircularized by setf- 
ligation. cloned and designated pBRAR1-Sal. The 1.4kb BamHI-BamHI fragment was inserted into the 
Sam HI-digested. alkaline phosphatase treated pBRAR1-Sal vector to provide the plasmid pPGAP of about 
5.3kb with the orientation in the opposite direction of the amp r . 



Construction of GAL regulated containing plasmids. 



35 



40 



45 



50 



55 



Plasmid pLGSD5 is prepared as described in Guarente et al., (1982) supra. The plasmid was 
manipulated as follows: After restriction with Xho l. the overhangs were" filled in with the Klenow fragment of 
DNA polymerase I ("Klenow fragment"), ligated with EcoRI linkers (GGAATTCC) and then completely 
digested with Eco RI and Sau 3A to provide a 370bp fragment which was isolated by gel electrophoresis and 
included the intergenic sequence between GAL1 and GAL10 genes of yeast, and provides for the GAL4 
.regulation sequence of the GAL1 and GAL10 genes. 

This fragment was inserted into pBR322 which had been completely digested with EcoRI and BamHI, 
followed by treatment with alkaline phosphatase to prevent oligomerization resulting in plasmid pBRGAL4. 

Plasmid pBRGAL4 was completely digested with Sau3A. the overhangs filled in with the Klenow 
fragment, and the resulting blunt-ended fragment ligated with Sail linkers (CGTCGACG), followed by 
digestion with Sail and Xho l. The resulting 370bp fragment waiTsolated by gel electrophoresis. This 
fragment has the original 370bp yeast GAL4 regulator sequence with Xhol and Sail termini. 

The fragment was then cloned in the plasmid plots, plots was prepared bylnserting the 40bp poiylinker 
fragment of the following sequence 

EcoRI BamHI Bgl ll Xba l Hin dlll. 

r r "n" n r 

5 ' AATTCCCGGGGATCCGTCGACCTGCAGATCTCTAGAAGCTTCAG 
3 1 GGGCCCCTAGGCAGCTGGACGTCTAGAGATCTTCGAAGTC 

I II I 

Sma l Sai l Pst I PvuII 

into pBR322 as an Eco RI- Pvu ll substitution followed by insertion of the trp-lac promoter (Russell and 
Bennett. Gene (1982) 20:231-245) into the Pvuil site with transcription oTiented toward the poiylinker 
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sequence plot5 was completely digested with Sail, followed by treatment with alkaline phosphatase and the 
370bp fragment inserted into plots to provide plasmid-plot5GAL4/370. This plasmid was then completely 
digested with BamHl and Sail to reproduce the individual fragment extended by 6bp of the polyimker 
fragment. This "fragment was then ligated into pCl/1. which had been completely digested with BamHl and 

s Sail followed by treatment with alkaline phosphatase to prevent recirculation. The resulting plasmid was 
designated pCl/lGAL4/370. The BamHt-Sall fragment is located in the pBR322 portion of the vector pCl/1. 

The phSOD polypeptide made in yeast was shown to be identical to the natural human protein. 
Migration of hSOD made in yeast was identical to the protein in both polyacrylamide gel electrophoresis 
(with and without sodium dodecy I sulfate) and in agarose gel electrophoresis (see above). Moreover, when 

io highly purified yeast polypeptide was subjected to twelve cycles of Edman degradation; the sequence was 
the same as that reported for the human protein (residues 1-10) made in E. coli set forth above. The level of 
detection, however, was only 5 to 10% of the expected level on a molar basis relative to protein input. This 
reduced detection level indicated that the N-terminal amino acid was blocked, i.e., probably acetylated. The 
result was confirmed by comparative analysis of tryptic peptides derived from yeast-produced hSOD and 

rs authentic acetylated human material which showed that all the expected trypic proteins were tdentical in the 
two samples including the N-terminal one, thus indicating the presence of acetylated N-terminal ALA m the 
yeast expression product. 

20 Isolation of the Human SOD Gene 

To isolate the human SOD gene, a bacteriophage lambda library representative of the human genome 
(R Lawn et al. (1978) Cell 15:1157-1174) was screened with a radiolabeled DNA probe made from the 
human SOD cDNA. One million phage plaques were screened, and 13 positively hybridizing plaques>;were 

25 purified. Restriction endonuclease analysis of the phage DNAs indicated that there are at least 5 different 
genes suggesting that there are other SOD related genes and gene products. One candidate for. such a 
gene is the recently discovered extracellular Cu/Zn SOD (S. Markiund. (1982) Proc. Natl. Acad. SciaUSA 
79-7634-7638) To distinguish the authentic cytoplasmic Cu/Zn SOD gene from the related ones we; used 
synthetic DNA probes 5 -AATGCTTCCCCACACC-3' and 5 -CTC AGTTAAAATGTCTGTTTG-3 correspond; 

30 ing to amino acid residues 19-26 and nucleotides 193-213 3 from the terminator codon m the 3 
untranslated region, respectively. Only one of the 13 genomic DNAs hybridized with these probes, 
indicating that it is the authentic human cytoplasmic SOD gene. This was confirmed by DNA sequence 
analysis of the N-terminal region, as shown in Fig. 5 where the amino acid sequence determined by protein 
sequencing is confirmed. This also shows that no preprotein exists for SOD since an in-frame termination 

35 codon exists nine nucleotides 5' from the initiator methionine codon. As shown in Figure 5. the human 
Cu/Zn SOD gene contains intervening sequences. The map of the SOD gene shown in Figure 6 indicates 
that there is more than one intervening sequence. 

Although the foregoing invention has been described in some detail by way of illustration and example 
for purposes of clarity of understanding, it- will be obvious that certain changes and modifications may be 

40 practiced within the scope of the appended claims. 

E coli strain D1210 (pSODX8) was deposited at the A.T.C.C. on September 27, 1983 and given 
Accession" No. 39453. Yeast strain 2150-2-3 ( P C1/1 GAPSOD) and E. coli strains D1210 (pSOD11) and 
D1210 (pS20R) were deposited at the A.T.C.C. on May 9, 1984, and given Accession Nos. 20708. 39679 
and 39.680, respectively. 
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Claims 



1 A new DNA construct capable of expression of human superoxide dismutase in an unicellular 
microorganism and containing in downstream order of transcription a promoter, a ribosomal binding site, an 
initiation codon, a structural gene and a transcription terminator, characterized in that the structural gene 
encodes a human superoxide dismutase structural gene or alleles thereof, and said structural gene is in 
reading frame with said initiation codon having a stop codon at the 3 -terminus. 

2. A DNA construct according to claim 1 . wherein the nucleotide composition between said ribosomal 
binding site and the initiation codon, and in the two amino-terminal codons (Ala-Thr) of the human 
superoxide dismutase structural gene, is varied to provide an expression level of at least 5%. 

3. The DNA construct according to claim 2 wherein said nucleotide composition is 5 - 
AACAATGGCAACG-3'. 
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4. The DNA construct according to claim 2 wherein said nucleotide composition is 5 # - 
ACATATGGCAACA-3 . 

5. A DNA construct according to claim 1 , wherein the nucleotide composition between said ribosomal 
binding site and the initiation codon. and in the two amino-terminal codons (Ala-Thr) of the human 
superoxide dismutase structural gene, is varied to provide an expression level of at least 7.5%. 

6. The DNA construct according to claim 5 wherein said nucleotide composition is 5'- 
AACTATGGCAACG-3 . 

7. A DNA construct according to claim 1, wherein the nucleotide composition between said ribosomal 
binding site and the initiation codon, and in the two amino-terminal codons (Ala-Thr) of the human 
superoxide dismutase structural gene is varied to provide an expression level of at least 10%. 

8. The DNA construct according to claim 7 wherein said nucleotide composition is 5*- 
GTATATGGCTACG-3 . 

9. The DNA construct according to claim 7 wherein said nucleotide composition is 5- 
AATCATGGCAACA-3 . 

10. The DNA construct according to claim 7 wherein said nucleotide composition is 5*- 
ATTTATGGCTACT-3 . 

1 1 A DNA construct according to claim 1 , wherein said unicellular microorganism is a bacterium. 

12. A DNA construct according to claim 1, wherein said unicellular microorganism is a yeast. 

13. A DNA construct according to claim 12, wherein said yeast is S. cerevisiae. 

14. A DNA construct according to claim 1. joined to a replication system for extrachromosomal 
replication and maintenance. 
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Translation of hsod 



10 20 



atv asp gly pro val gin gly lie Me asn phe glu g!n lys glu 
GGC GAC GGC CCA GTG CAG GGC ATC ATC AAT TTC GAG CAG AAG GAA 
*Sau 96 I *Taql 



ser asn gly pro val lys vai trp gly ser He lys gly leu thr 

AGT AAT GGA CCA GTG AAG GTG TGG GGA AGC ATT AAA GGA CTG ACT 
*Sau 96 I 

40 50 

olu qW leu his gly phe his val his glu phe gly asp asn thr 

GAA GGC CTG CAT GGA TTC CAT. GTT CAT GAG TTT GGA GAT AAT ACA 



60 

ala gly cys thr ser ala gly pro his phe asn pro leu ser arg 
GCA GGC TGT ACC AGT GCA GGT CCT CAC TTT AAT CCT CTA TCC AGA 

70 80 

lys his gly gly pro lys asp glu glu arg his val gly asp leu 
AAA CAC GGT GGG CCA AAG GAT GAA GAG AGG CAT GTT GGA GAC TTG 

90 

oIy asn val thr ala asp lys asp gly val ala asp val ser lie 
GGC AAT GTG ACT GCT GAC AAA GAT GGT GTG GCC GAT GTG TCT ATT 

100 110 

qlu asp ser val He ser leu ser gly asp his cys He lie gly 
GAA GAT TCT GTG ATC TCA CTC TCA GGA GAC CAT TGC ATC ATT GGC 

120 

arg thr leu val val His glu lys ala asp asp leu gly lys gjy 
CGC ACA CTG GTG GTC CAT GAA AAA GCA GAT GAC TTG GGC AAA GGT 

130 140 

qlv asn glu glu ser thr lys thr gly asn ala gly ser arg leu. 
GGA AAT GAA GAA AGT ACA AAG ACA GGA AAC GCT GGC AGT CGT TTG 

150 153 
ala cys alv val Tie gly He ala gin OC 

^CT TGT GGT GTA ATT GGG ATC GCC CAA TAA ACATTCCCTTGGCTGTAGTCT 

GAGGCCCCTTAACTCATCTGTTATCCTGCTAGCTC- 

«Alul 



Translated Mol. Weight = 14893.03 



FIG. 4 
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